Thermodynamics of Peptide Aggregation Processes. 
An Analysis from Perspectives of Three Statistical Ensembles 
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We employ a mesoscopic model for studying aggregation processes of protein-like hydrophobic- 
polar heteropolymers. By means of multicanonical Monte Carlo computer simulations, we find strong 
indications that peptide aggregation is a phase separation process, in which the microcanonical 
entropy exhibits a convex intruder due to nonnegligible surface effects of the small systems. We 
analyze thermodynamic properties of the conformational transitions accompanying the aggregation 
process from the multicanonical, canonical, and microcanonical perspective. It turns out that the 
microcanonical description is particularly advantageous as it allows for unraveling details of the 
phase-separation transition in the thermodynamic region, where the temperature is not a suitable 
external control parameter anymore. 
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I. INTRODUCTION 

Beside receptor-ligand binding mechanisms, folding 
and aggregation of proteins belong to the biologically 
most relevant molecular structure formation processes. 
While the specific binding between receptors and lig- 
ands is not necessarily accompanied by global coopera- 
tive structural changes, protein folding and oligomeriza- 
tion of peptides are typically accompanied by conforma- 
tional transitions Proteins and their aggregates are 
comparatively small systems. A typical protein consists 
of a sequence of some hundred amino acids and aggre- 
gates are often formed by only a few peptides. A very 
prominent example is the extracellular aggregation of the 
A/3 peptide, which is associated with Alzheimer's disease. 
Following the amyloid hypothesis, it is believed that these 
aggregates (which can also take fibrillar forms) are neu- 
rotoxic, i.e., they are able to fuse into cell membranes 
of neurons and open calcium ion channels. It is known 
that extracellular Ca 2+ ions intruding into a neuron can 
promote its degeneration @, H, 0] . 

Conformational transitions proteins experience dur- 
ing structuring and aggregation are not phase transi- 
tions in the strict thermodynamic sense and their sta- 
tistical analysis is usually based on studies of signals ex- 
posed by energetic and structural fluctuations, as well 
as system-specific "order" parameters. In these stud- 
ies, the temperature T is considered as an adjustable, 
external control parameter and, for the analysis of the 
pseudophase transitions, the peak structure of quanti- 
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ties such as the specific heat and the fluctuations of 
the gyration tensor components or "order" parameter 
as functions of the temperature are investigated. The 
natural ensemble for this kind of analysis is the canon- 
ical ensemble, where the possible states of the system 
with energies E are distributed according to the Boltz- 
mann probability exp(— E/ksT), where Ub is the Boltz- 
mann constant. However, phase separation processes of 
small systems as, e.g., droplet condensation, are accom- 
panied by surface effects at the interface between the 
pseudophases @, i, 0, S S M HO, El El M El • This 
is reflected by the behavior of the microcanonical en- 
tropy S(E), which exhibits a convex monotony in the 
transition region. Consequences are the backbending of 
the caloric temperature T(E) = (dS/dE)^ 1 , i.e., the 
decrease of temperature with increasing system energy, 
and the negativity of the microcanonical specific heat 
Cv(E) = (dT(E)/dE)- 1 = -{dS/dE) 2 /{d 2 S/dE 2 ). 
The physical reason is that the free energy balance in 
phase equilibrium requires the minimization of the inter- 
facial surface and, therefore, the loss of entropy [jjj]. A 
reduction of the entropy can, however, only be achieved 
by transferring energy into the system. Recently, we 
have shown that, employing a minimalistic hctcropoly- 
mer model, the aggregation of two small peptides is such 
a phase separation process, where we observed the men- 
tioned peculiar small-system effects [l7j- Here, we con- 
sider the aggregation process from the multicanonical, 
canonical, and microcanonical perspectives. Our results 
were obtained from multicanonical computer simulations 
of a mesoscopic hydrophobic-polar heteropolymer model 
for aggregation, which is based on a simple off-lattice 
model, originally introduced to study tertiary folding of 
proteins from a coarse-grained point of view. 

The paper is organized as follows. In Sect. [Til we define 
the aggregation model employed in our computational 
study, where we primarily used multicanonical sampling. 
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This method is also briefly described here as well as the 
aggregation "order" parameter needed to discriminate 
the pseudophases. Section IIIII is devoted to the main 
part of the paper: The presentation of the results for the 
aggregation of two small peptides obtained from mul- 
ticanonical, canonical, and microcanonical views. The 
comparison with results obtained for larger systems is 
performed in Sect. HVl The paper is concluded by a sum- 
mary of the results in Sect. [Vl 



II. MODEL AND METHODS 

For our aggregation study on mesoscopic scales, we em- 
ploy a novel model that is based on a known hydrophobic- 
polar single-chain approach, originally introduced for 
heteropolymer chains in two dimensions jl8j |. In this 
section, we define this model, describe the simulation 
methods, and introduce a suitable order parameter that 
allows for the discrimination of the macrostates or "pseu- 
dophases" the multiple-chain system can reside in. 



Mesoscopic hydrophobic-polar aggregation 
model 



For our aggregation study of protein-like heteropoly- 
mers, we assume that the tertiary folding process of the 
individual chains is governed by hydrophobic-core forma- 
tion in an aqueous environment. A comparatively sim- 
ple but powerful model is the AB model [18[ where only 
two types of amino acids are considered: hydrophobic 
residues (A) which avoid contact with the polar environ- 
ment and polar residues (B) being favorably attracted 
by the solvent. The model is a C" type model in that 
each residue is represented by only a single interaction 
site (the "C" atom"). Thus, the natural dihedral tor- 
sional degrees of freedom of realistic protein backbones 
are replaced by virtual bond and torsion angles between 
consecutive interaction sites. The large torsional barrier 
of the peptide bond between neighboring amino acids 
is in the AB model effectively taken into account by 
introducing a bending energy. Nonbonded residues ex- 
perience weak pairwise long-range attraction (AA and 
BB pairs) or repulsion (AB pairs), respectively. Al- 
though this coarse-grained picture is obviously not capa- 
ble to reproduce microscopic properties of specific realis- 
tic proteins, it qualitatively exhibits, however, sequence- 
dependent features known from nature, as, for example, 
tertiary folding pathways known from two-state folding, 
folding through intermediates, and metastability jl9j |. 

For our systems of more than one chain, we further as- 
sume that the interaction strengths between nonbonded 
residues is independent of the individual properties of 
the chains the residues belong to. Therefore, we use the 
same parameter sets as in the AB model for the pairwise 
interactions between residues of different chains. Our ag- 



gregation model reads [TtJ 

«=E^ + EE ; ct v > ^ )> (i) 

where /i, v label the M polymers interacting with each 
other, and i^, j v index the iV Mj „ monomers of the respec- 
tive juth and vth polymer. The intrinsic single-chain en- 
ergy is given by 

4b = jE( 1 - cos ^)+ E *(*w°*>°-4.). ( 2 ) 

V j M >V + 1 



with < di < it denoting the bending angle between 
monomers i^, i^ + l, ■ 
residue pair potential 



monomers i^, + 1, and i M + 2. The nonbonded inter- 
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depends on the distance r% j u between the residues, and 
on their type, ov = A,B. The long-range behavior 
is attractive for like pairs of residues [C(A,A) = 1, 
C(B,B) = 0.5] and repulsive otherwise [C(A, B) = 
C(B,A) = -0.5]. The lengths of all virtual peptide 
bonds are set to unity. 

In this paper, we report on results obtained from 
statistical mechanics studies of the aggregation pro- 
cesses of short polymers. Our primary interest is 
devoted to the heteropolymer with the sequence SI: 



AB2AB2ABAB2AB which is a Fibonacci sequence 18 1, 
whose single-chain properties are already known 2C | . 
Throughout the paper, we are going to study the thermo- 
dynamics of systems with up to 4 chains of this sequence 
over the whole energy and temperature regime. 



B. Simulation methods 

We have used generalized-ensemble Markovian Monte 
Carlo algorithms to sample the conformational space of 
the systems studied. The powerful error-weighted multi- 
canonical method 0, [22|, HH proved to be particularly 
useful as it makes it possible to scan the whole phase 
space with very high accuracy [2C| . The principle idea is 
to deform the Boltzmann energy distribution 



(E;T)cxg(E)eM-E/k B T), 



(4) 



where g(E) is the density of states with energy E and 
ksT is the thermal energy at temperature T, in such a 
way that the notoriously difficult sampling of the tails is 
increased and - particularly useful - the sampling rate of 
the entropically strongly suppressed lowest-energy con- 
formations is improved. In order to achieve this, the 
canonical Boltzmann distribution is modified by the mul- 
ticanonical weight W muca (E;T) which, in the ideal case, 
flattens the energy distribution: 

Pmuca(£) = W muca (E; T)p can (E; T) = const B;T . (5) 
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As the canonical distribution is, of course, not known in 
the beginning and W muca (E;T) ~ p~^(E\T), the mul- 
ticanonical weights have to be determined recursively, 
which can be done in an efficient way [HI, [24[ . Recalling 
that the simulation temperature T does not possess any 
meaning in the multicanonical ensemble as, according to 
Eq. l[5]). the energy distribution is always constant, inde- 
pendently of temperature. Actually, it is convenient to 
set it to infinity in which case limj^oo p ca , n (E; T) ~ g(E) 
and thus lim^ W mnca (£;T) ~ g^iE). The latter 
expression is sometimes parametrized as W muca (E) ~ 
exp[—j3(E)E+a(E)], where, forasuitable choice of a(E), 
(3{E) can be identified with the microcanonical temper- 
ature m. 

In our simulations, conformational changes of the indi- 
vidual chains included spherical updates [201 ] and semilo- 
cal crankshaft moves, i.e., rotations around the axis be- 
tween the nth and (n + 2)th residue. A typical multi- 
canonical run contained of the order of 10 10 single up- 
dates. The polymer chains were embedded into a cubic 
box with edge lengths L and periodic boundary condi- 
tions were used. In our simulations, the edge lengths of 
the simulation box were chosen to be L — 40 which is 
sufficient to reduce undesired finite-size effects. 

For cross-checks we have also performed replica- 
exchange (parallel tempering) simulations [ID, [26[ . Ver- 
ifying lowest-energy conformations found in the mul- 
ticanonical simulations, we have also performed opti- 
mization runs using the energy-landscape paving (ELP) 
method Wh. 



"Order" parameter of aggregation and 
fluctuations 



In order to distinguish between the fragmented and the 
aggregated regime, we introduce the "order" parameter 
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According to the Boltzmann distribution |QJ , we define 
canonical expectation values of any observable O by 



(0)(T) 



^can(T) 1^ 
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-E({X„})/k B T 
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where the canonical partition function Z can is given by 
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-B({X M })/fe s T 



(8) 



Formally, the integrations are performed over all possible 
conformations of the M chains. 

Similarly to the specific heat per monomer cy(T) = 
d{E)/N tot dT = ({E 2 ) - (E) 2 )/N tot k B T 2 (with A tot = 
^m) wmcn expresses the thermal fluctuations of 
energy, the temperature derivative of (r) per monomer, 
d(T)/N tot dT = ((TE) - (T)(E))/N tot k B T 2 , is a useful 
indicator for cooperative behavior of the multiple-chain 
system. Since the system size is small - the number of 
monomers N to t as well as the number of chains M - 
aggregation transitions, if any, are expected to be sig- 
nalized by the peak structure of the fluctuating quanti- 
ties as functions of the temperature. This requires the 
temperature to be a unique external control parameter 
which is a natural choice in the canonical statistical en- 
semble. Furthermore, this is a typically easily adjustable 
and, therefore, convenient parameter in experiments. As 
we have stressed recently [1_7[, however, aggregation is a 
phase separation process and, since the system is small, 
there is no uniform mapping between temperature and 
energy. For this reason, the total system energy is the 
more appropriate external parameter. Thus, the micro- 
canonical interpretation will turn out to be the more fa- 
vorable description, at least in the transition region. We 
will discuss this in detail in the following section. 



where the summations are taken over the minimum dis- 
tances d pcr = (dper, rfpor, dper) of the respective cen- 
ters of mass of the chains (or their periodic continua- 
tions). The center of mass of the /^th chain in a box 
with periodic boundary conditions is defined as i"com,^ = 

Ei M =i [ d por (rv,rij + rij /A M , where r 1(J is the coor- 
dinate vector of the first monomer and serves as a refer- 
ence coordinate in a local coordinate system. 

Our aggregation parameter is to be considered as a 
qualitative measure; roughly, fragmentation corresponds 
to large values of T, aggregation requires the centers of 
masses to be in close distance in which case T is compar- 
atively small. Despite its qualitative nature, it turns out 
to be a surprisingly manifest indicator for the aggrega- 
tion transition and allows even a clear discrimination of 
different aggregation pathways, as will be seen later on. 



III. STATISTICS OF THE TWO-CHAIN 
HETEROPOLYMER SYSTEM IN THREE 
ENSEMBLES 



For the qualitative description of the aggregation and 
the accompanied conformational cooperativity within the 
whole system, it is sufficient to consider a very small 
system which is computationally reliably tractable and 
nonetheless yields precise results for all energies and tem- 
peratures. Our heteropolymer system consists of two 
identical chains with the amino acid composition SI and 
will be denoted as 2xSl. In the following, we discuss 
the aggregation behavior of this system from the multi- 
canonical, the canonical, and the microcanonical point of 
view. 
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Multicanonical results 



(a) 



In a multicanonical simulation, the phase space is sam- 
pled in such a way that the energy distribution gets as 
flat as possible. Thermodynamically, this means that the 
sampling of the phase space is performed for all temper- 
atures within a single simulation [2TI . I22I I23I [24| . The 
desired information for the thermodynamic behavior of 
the system at a certain temperature is then obtained by 
simply reweighting the multicanonical into the respec- 
tive canonical distribution, according to Eq. ([5]). Since 
the multicanonical ensemble contains all thermodynamic 
informations, including the conformational transitions, it 
is quite useful to measure within the simulation the mul- 
ticanonical histogram 



rv 



(9) 



where t muca labels the Monte Carlo "time" steps. 
More formally, this distribution can be expressed as a 
conformation-space integral 
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FIG. 1: (a) Multicanonical histogram log 10 /i mU ca as a func- 
tion of energy E and aggregation parameter T, (b) section of 
log 10 /imuca in the low-energy tail. 



with the multicanonical energy H.muca.{E) = E — 
ksThi W muca (E;T) which is independent of tempera- 
ture. 

The multicanonical partition function is also trivially 
a constant in temperature, 



7 muca — 



e -W muca (E({X,}))/ fcB T = congtT _ 



fill 



It is obvious that integrating h mnca (E, T) over T recovers 
the uniform multicanonical energy distribution: 



dT h„ 



(12) 



The canonical distribution of energy and T parameter 
at temperature T can be retained, similar to inverting 
Eq. l[5]). by performing the simple reweighting 



h can (E,T;T) = h muca (E,T)W-^ ca (E;T), 



(13) 



which is, due to the restriction to a certain tempera- 
ture, less favorable to gain an overall impression of the 
phase behavior (i.e., the transition pathway) of the sys- 
tem, compared to the multicanonical analog h muca (E, T). 

In Fig.QJa), h muca (E, T) is shown for the two-peptide 
system 2xSl as a color-coded projection onto the E-T 



plane, which is the direct output obtained in the multi- 
canonical simulation. Qualitatively, we observe two sep- 
arate main branches (which are "channels" in the cor- 
responding free-energy landscape), between which a no- 
ticeable transition occurs. In the vicinity of the energy 
E SC p ~ —3.15, both channels overlap, i.e., the associ- 
ated macrostates coexist. Since T is an effective mea- 
sure for the spatial distance between the two peptides, 
it is obvious that conformations with separated or frag- 
mented peptides belong to the dominating channel in the 
regime of high energies and large T values, whereas the 
aggregates are accumulated in the narrow low-energy and 
smalhT channel. Thus, the main observation from the 
multicanonical, comprising point of view is that the ag- 
gregation transition is a phase-separation process which, 
even for this small system, already appears in a surpris- 
ingly clear fashion. 

The high precision of the multicanonical method al- 
lows us even to reveal further details in the lowest-energy 
aggregation regime, which is usually a notoriously dif- 
ficult sampling problem. Figure HJb) shows that the 
tight aggregation channel splits into three separate, al- 
most degenerate subchannels at lowest energies. From 
the analysis of the conformations in this region, we find 
that representative conformations with smallest T val- 
ues, r w 0.45, are typically entangled, while those with 
r » 0.8 have a spherically-capped shape. This is the sub- 



entangled 



capped 



FIG. 2: Representatives and schematic characteristics of 
entangled and spherically-capped conformations dominating 
the lowest-energy branches in the multicanonical histogram 
shown in Fig. QJb). Dark spheres correspond to hydrophobic 
(A), light ones to polar (B) residues. 



channel connected to the lowest-energy states. Examples 
are shown in Fig. [2] The also highly compact conforma- 
tions belonging to the intermediate subphase do not ex- 
hibit such characteristic features and are rather globules 
without noticeable internal symmetries. In all cases, the 
aggregates contain a single compact core of hydrophobic 
residues. Thus, the aggregation is not a simple docking 
process of two prefolded peptides, but a complex coopera- 
tive folding-binding process. This is a consequence of the 
energetically favored hydrophobic inter-residue contacts 
which, as the results show, overcompensate the entropic 
steric constraints. The story is, however, even more in- 
teresting, as also nonnegligible surface effects come into 
play. After the following standard canonical analysis, 
this will be discussed in more detail in the subsequent 
microcanonical interpretation of our results. 



B. Canonical perspective 

Phase transitions are typically described in the canoni- 
cal ensemble with the temperature kept fixed. This is also 
natural from an experimentalist's point of view, since the 
temperature is a convenient external control parameter. 
The macrostates are weighted according to the Boltz- 
mann distribution Q. A nice feature of the canonical 
ensemble is that the temperature dependence of fluctua- 
tions of thermodynamic quantities is usually a very useful 
indicator for phase or pseudophase transitions. This co- 
operative thermodynamic activity is typically signalized 
by peaks or, in the thermodynamic limit (if it exists), by 
divergences of these fluctuations. Even for small systems, 
peak temperatures can frequently be identified with tran- 
sition temperatures. Although in these cases peak tem- 
peratures typically depend on the fluctuating quantities 
considered, in most cases associated pseudophase transi- 
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FIG. 3: (a) Mean energy {E)/N to t and specific heat per 
monomer cy, and (b) (T)/Nu>t and d{T) /N to tdT as functions 
of the temperature. 



tions are doubtlessly manifest. In such cases, the transi- 
tion ranges over an extended temperature interval, as, 
e.g., in the folding process of proteins or heteropoly- 
mers [Hj]. 

In our aggregation study of the 2xSl system, however, 
we obtain from the canonical analysis a surprisingly clear 
picture of the aggregation transition. Figure OUa) shows 
the canonical mean energy (E) and the specific heat per 
monomer cy, plotted as functions of the temperature T. 
In Fig. [3)^b) , the temperature dependence of the mean 
aggregation order parameter (r) and the fluctuations of 
r are shown. The aggregation transition is signalized 
by very sharp peaks and from both figures we read off 
peak temperatures close to T agg « 0.20. The aggrega- 
tion of the two peptides is a single-step process, in which 
the formation of the aggregate with a common compact 
hydrophobic core governs the folding behavior of the in- 
dividual chains. Folding and binding are not separate 
processes. 

The dominance of the inter-chain binding interaction 
can also be seen by considering the lowest-energy con- 
formation found in our simulations. The energy of this 
conformation, which is shown in Fig. [H is E m [ n —18.4 
in our energy units. The peptide-peptide binding energy 
[i.e., the second term in Eq. |T])] is with E^g ' min ~ —11.4 
much stronger than the intrinsic single-chain energies 
E 
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AB.min 



-3.2 and E 



(2) 

AS, min 



-3.8, respectively. The 
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FIG. 4: The minimum-energy 2xSl complex with _E m i 
— 18.4 as found in our simulations is a capped aggregate. 



^single 
,min 



-5.0 21 



single-chain minimum energy is with E AB ° 
noticeably smaller. 

The comparatively strong inter-chain interaction and 
the strength of the aggregation transition despite the 
smallness of the system lead to the conclusion that sur- 
face effects are of essential importance for the aggregation 
of the peptides. This is actually confirmed by a detailed 
microcanonical analysis which is performed in the next 
subsection. 
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FIG. 5: Microcanonical Hertz entropy S(E) of the 2xSl sys- 
tem, concave Gibbs hull TLs{E), and inverse caloric temper- 
ature T~ 1 (E) as functions of energy. The phase separation 
regime ranges from E agg to Ef rag . Between T.J 1 and T> , the 
temperature is no suitable external control parameter and the 
canonical interpretation is not useful: The inverse caloric tem- 
perature T~ 1 (E) exhibits an obvious backbending in the tran- 
sition region. Note the second, less-pronounced backbending 
in the energy range E < < E < i?f rag . 



C. Microcanonical interpretation 

In the microcanonical analysis, the system energy E 
is kept (almost) fixed and treated as an external control 
parameter. The system can only take macrostates with 
energies in the interval (E, E + AE) with AE being suffi- 
ciently small to satisfy AG(E) = g(E)AE, where AG{E) 
is the phase space volume of this energetic shell. In the 
limit AE — > 0, the total phase space volume up to the 
energy E can thus be expressed as 

G(E)= I dE'g(E'). (14) 

JE min 

Since g(E) is positive for all E, G(E) is a monotonically 
increasing function and this quantity is suitably related 
to the microcanonical entropy S{E) of the system. In 
the definition of Hertz, 

S(E) = k B \n G(E). (15) 

Alternatively, the entropy is often directly related to the 
density of states g{E) and defined as 

S(E) = k B ^g(E). (16) 

The density of states exhibits a decrease much faster than 
exponential towards the low-energy states. For this rea- 
son, the phase-space volume at energy E is strongly dom- 
inated by the number of states in the energy shell AE. 
Thus G(E) ps AG{E) ~ g(E) is directly related to the 
density of states. This virtual identity breaks down in 
the higher-energy region, where In g(E) is getting flat - 
in our case far above the energetic regions being rele- 
vant for the discussion of the aggregation transition (i.e., 
for energies E 3> -Ef ra g, see Fig.[S]). Actually, both defini- 
tions of the entropy led in our study to virtually identical 



results in the analysis of the aggregation transition [171 ]. 
The (reciprocal) slope of the microcanonical entropy fixes 
the temperature scale and the corresponding caloric tem- 
perature is then defined via T(E) = (dS(E)/dE)~ 1 for 
fixed volume V and particle number iVtot- 

As long as the mapping between the caloric tempera- 
ture T and the system energy E is bijective, the canoni- 
cal analysis of crossover and phase transitions is suitable 
since the temperature can be treated as external con- 
trol parameter. For systems, where this condition is not 
satisfied, however, in a standard canonical analysis one 
may easily miss a physical effect accompanying conden- 
sation processes: Due to surface effects (the formation 
of the contact surface between the peptides requires a 
rearrangement of monomers in the surfaces of the indi- 
vidual peptides), additional energy does not necessarily 
lead to an increase of temperature of the condensate. 
Actually, the aggregate can even become colder. The 
supply of additional energy supports the fragmentation 
of parts of the aggregate, but this is overcompensated by 
cooperative processes of the particles aiming to reduce 
the surface tension. Condensation processes are phase- 
separation processes and as such aggregated and frag- 
mented phases coexist. Since in this phase-separation re- 
gion T and E are not bijective, this phenomenon is called 
the "backbending effect" . The probably most important 
class of systems exhibiting this effect is characterized by 
their smallness and the capability to form aggregates, 
depending on the interaction range. The fact that this 
effect could be indirectly observed in sodium clustering 
experiments [8( gives rise to the hope that backbending 
could also be observed in aggregation processes of small 
peptides. 

Since the 2xSl system apparently belongs to this class, 
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the backbending effect is also observed in the aggrega- 
tion/fragmentation transition of this system. This is 
shown in Fig. [5l where the microcanonical entropy S(E) 
is plotted as function of the system energy. The phase- 
separation region of aggregated and fragmented confor- 
mations lies between E agg w —8.85 and Ef lag w 1.05. 
Constructing the concave Gibbs hull Hs(E) by linearly 
connecting S(E agg ) and S(Ef vag ) (straight dashed line in 
Fig. [5]), the entropic deviation due to surface effects is 
simply AS(E) = H S {E) - S(E). The deviation is max- 
imal for E = E scp and AS(E scp ) = AiS sm f is the sur- 
face entropy. The Gibbs hull also defines the aggregation 
transition temperature 

T am = (dH s {E)/dE)-\ (17) 

For the 2xSl system, we find T agg sa 0.198, which is vir- 
tually identical with the peak temperatures of the fluc- 
tuating quantities discussed in Sect. MI Bl 

The inverse caloric temperature T~ 1 (E) is also plot- 
ted into Fig. [5] For a fixed temperature in the inter- 
val T< < T < T y (T< w 0.169 and T> w 0.231), 
different energetic macrostates coexist. This is a con- 
sequence of the backbending effect. Within the back- 
bending region, the temperature decreases with increas- 
ing system energy. The horizontal line at T~ gg w 5.04 is 
the Maxwell construction, i.e., the slope of the Gibbs hull 
TLs{E). Although the transition seems to have similari- 
ties with the van der Waals description of the condensa- 
tion/evaporation transition of gases - the "overheating" 
of the aggregate between T agg and T> (within the en- 
ergy interval £"a gg < E < E> « —5.13) is as apparent 
as the "undercooling" of the fragments between T< and 
T^g (in the energy interval -Ef rag > E > E < w —1.13) - 
it is important to notice that in contrast to the van der 
Waals picture the backbending effect in-between is a real 
physical effect. Another essential result is that in the 
transition region the temperature is not a suitable ex- 
ternal control parameter: The macrostate of the system 
cannot be adjusted by fixing the temperature. The bet- 
ter choice is the system energy which is unfortunately 
difficult to control in experiments. Another direct conse- 
quence of the energetic ambiguity for a fixed temperature 
between T < and T> is that the canonical interpretation is 
not suitable for detecting the backbending phenomenon. 
It should also be noted that in this region, the micro- 
canonical specific heat cy(E) can become negative [l7| . 
which is a remarkable, but somehow "exotic" side effect. 

The precise microcanonical analysis reveals also a fur- 
ther detail of the aggregation transition. Close to E pl - C w 
—0.32, the T _1 curve in Fig. [5] exhibits another "back- 
bending" which signalizes a second, but unstable tran- 
sition of the same type. The associated transition tem- 
perature T pre « 0.18 is smaller than T agg , but this tran- 
sition occurs in the energetic region where fragmented 
states dominate. Thus this transition can be interpreted 
as the premelting of aggregates by forming intermediate 
states. These intermediate structures are rather weakly 
stable: The population of the premolten aggregates never 
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FIG. 6: Logarithmic plots of the canonical energy histograms 
(not normalized) at T ~ 0.18 and T m 0.20, respectively. 



dominates. In particular, at T prc , where premolten ag- 
gregates and fragments coexist, the population of com- 
pact aggregates is much larger. This can nicely be seen 
in the canonical energy histograms at these tempera- 
tures plotted in Fig. [6l where the second backbending 
is only signalized by a small cusp in the coexistence re- 
gion. Since both transitions are phase-separation pro- 
cesses, structure formation is accompanied by releas- 
ing latent heat which can be defined as the energetic 
widths of the phase coexistence regimes, i.e., AQ agg = 
Efr ag — E agg = T agg [S(Ef rag ) — S(E agg )] « 9.90 and 
AQ pro = £f rag — E pTe = T pTe [S (Ef Tas ) — S(E pie )\ w 1.37. 
Obviously, the energy required to melt the premolten ag- 
gregate is much smaller than to dissolve a compact (solid) 
aggregate. 

For the comparison of the surface entropies, we use the 
definition (fTBj) of the entropy. In the case of the aggrega- 
tion transition, the surface entropy is A<S^ f ss AS^f f = 
H s (E scp ) - S{E scp ), where H S (E) « H S (E) is the con- 
cave Gibbs hull of S(E). Since H s (E scp ) = H s (E h . ag ) - 
(E flag ~E scp )/T agg and H s (E frag ) = S{E hag ), the surface 
entropy is 

^■ S suff = s ( E frag) - S(E scp ) - — — (E irag - E scp ). (18) 

J a gg 

Yet utilizing that the canonical distribution h can (E) at 
T agg shown in Figgis h can (E) ~ g(E) exp(-£ /k B T agg ), 
the surface entropy can be written in the simple and com- 
putationally convenient form [111 ]: 

AS:^ = k B \n H - { ^ ) (19) 

'''can J 

A similar expression is valid for the coexistence of pre- 
molten and fragmented states at T pre The corresponding 
canonical distribution is also shown in Fig. [SJ Thus, we 
obtain (in units of fee) for the surface entropy of the ag- 
gregation transition AS^f f w 2.48 and for the premelting 
AS'gJfc ~ 0.04, confirming the weakness of the interface 
between premolten aggregates and fragmented states. 
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IV. AGGREGATION TRANSITION IN 
LARGER HETEROPOLYMER SYSTEMS 

In order to verify the general validity of the statements 
in the previous section for the 2xSl system, we have 
also performed simulations of systems consisting of three 
(in the following referred to as 3xSl) and four (4xSl) 
identical peptides with sequence SI. 

Although the formation of compact hydrophobic cores 
is more complex in larger compounds of our exemplified 
sequence SI, the aggregation transition is little influenced 
by this. This is nicely seen in Figs.JTJa) and[7£b), where 
the temperature dependence of the canonical expecta- 
tion values of T and E, as well as for their fluctuations, 
are shown for the 3xSl system. For comparison, also 
results for the 4xSl system are plotted into the same 
figures. Note that for the 4xSl system finite-size effects 
are larger since, for computational reasons, we have kept 
the edge length of the simulation box L = 40, which is 
smaller than the successive arrangement of four straight 
chains with 13 monomers. This influences primarily the 
entropy in the high-energy regime far above the aggre- 
gation transition energy. Nonetheless, in the canonical 
interpretation, it acts back onto the transition as unde- 
sired states (chain ends overlapping due to the periodic 
boundary conditions) are (weakly) populated at the tran- 
sition temperature, whereas others are suppressed. We 
have performed a detailed analysis of the box size de- 
pendence (results not shown) and found that the canoni- 
cal transition temperature scales slightly, but noticeably 
with the box size. Thus, the results obtained by canonical 
statistics for the 4xSl system should not quantitatively 
be compared to the canonical results for the 2xSl and 
3xSl systems. 

As has already been discussed for the 2xSl system, 
there are also for the larger systems no obvious signals 
for separate aggregation and hydrophobic-core formation 
processes. Only weak activity in the energy fluctuations 
in the temperature region below the aggregation transi- 
tion temperature indicates that local restructuring pro- 
cesses of little cooperativity (comparable with the discus- 
sion of the premolten aggregates in the discussion of the 
2xSl system) are still happening. The strength of the ag- 
gregation transition is also documented by the fact that 
the peak temperatures of energetic and aggregation pa- 
rameter fluctuations are virtually identical for the 3xSl 
system, i.e., the aggregation temperature is T agg w 0.21 
(for 4xSl T agg w 0.22). 

For homogeneous multiple-chain systems two variants 
of thermodynamic limits are of particular interest: (i) 
M — > oo, while = const, (ii) — > oo with 
M = const; both limits considered for constant poly- 
mer density. Since for proteins the sequence of amino 
acids is fixed, in this case only (i) is relevant and it is 
future work to perform a scaling analysis for multiplc- 
peptide systems in this limit. A particularly interesting 
question is to what extent remnants of the finite-system 
effects, as discussed in this paper, survive in the limit of 
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an infinite number of chains, dependent on the peptide 
density. Since we have focused our study on the pre- 
cise analysis of systems of few peptides for all energies 
and temperatures, it was computationally inevitable to 
restrict ourselves to small systems, for which a scaling 
analysis is not very useful. Nonetheless, we would like 
to devote a few interesting remarks to the comparison 
of, once more, microcanonical aspects of the aggregation 
transition in dependence of the system size. 

In Fig. [8j the microcanonical entropies per monomer 
s(e) = S(e)/N to t (shifted by an unimportant constant 
for clearer visibility) and the corresponding Gibbs hulls 
h s (e) = Hs(e)/N tot are shown for 2xSl (in the figure de- 
noted by "2"), 3xSl ("3"), and 4 x SI ("4"), respectively, 
as functions of the energy per monomer e = E/N to t- 
Although the convex entropic "intruder" is apparent for 
larger systems as well, its relative strength decreases with 
increasing number of chains. The slopes of the respective 
Gibbs constructions determine the aggregation temper- 
ature (H7J) which are found to be T a 3 g x g si « 0.212 and 
T agg si » 0.217 confirming the peak temperatures of the 
fluctuation quantities plotted in Fig. [7] 

The existence of the interfacial boundary entails a 
transition barrier whose strength is characterized by the 
surface entropy AiS sur f. In Fig. [SJ the individual en- 
tropic deviations per monomer, As(e) = AS(e)/N to t are 
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FIG. 8: Microcanonical entropies per monomer s(e), respec- 
tive Gibbs constructions h s (e) (left scale), and deviations 
As(e) = h a (e) — s(e) (right scale) for 2xSl (labeled as 
2), 3xSl (3), and 4xSl (4) as functions of the energy per 
monomer e. 



TABLE I: Aggregation temperatures T agg , surface entropies 
A<Ssu r f, relative surface entropies per monomer As sur f, rel- 
ative aggregation and fragmentation energies per monomer, 
e agg and ef rag , respectively, latent heat per monomer Aq, and 
phase-separation entropy per monomer Ag/Tagg. All quanti- 
ties for systems consisting of two, three, and four 13mers with 
AB sequence SI. 



system 


Tagg 


A(S S urf 




e agg 


Gfrag 


Aq 


Ag/T agg 


2xSl 


0.198 


2.48 


0.10 


-0.34 


0.04 


0.38 


1.92 


3xSl 


0.212 


2.60 


0.07 


-0.40 


0.05 


0.45 


2.12 


4xSl 


0.217 


2.30 


0.04 


-0.43 


0.05 


0.48 


2.21 



also shown and the maximum deviations, i.e., the sur- 
face entropies A<S sur f and relative surface entropies per 
monomer As sur f = AS sur {/N to t are listed in Table Q] 
There is no apparent difference between the values of 
AiS sur f that would indicate a trend for a vanishing of 
the absolute surface barrier in larger systems. However, 
the relative surface entropy As sur f obviously decreases. 
Whether or not it vanishes in the thermodynamic limit 
cannot be decided from our results and is a study worth 
in its own right. 

It is also interesting that subleading effects increase 
and the double- well form found for 2xSl changes by 
higher-order effects, and it seems that for larger systems 
the almost single-step aggregation of 2 x SI is replaced by 
a multiple-step process. 

Not surprisingly, the fragmented phase is hardly in- 
fluenced by side effects and the rightmost minimum in 
Fig. M lies well at e frag = E {Ta , g /N to t ~ 0.04 - 0.05. 
Since the Gibbs construction covers the whole convex 
region of s(e), the aggregation energy per monomer 
e agg = E agg /N to t corresponds to the leftmost minimum 
and its value changes noticeably with the number of 
chains. In consequence, the latent heat per monomer 



Aq = AQ/N tot = T agg [<S(£f rag ) - S(£ agg )] /JVtot that is 
required to fragment the aggregate increases from two 



to four chains in the system (see Table HJ. Although 
the systems under consideration are too small to extrap- 
olate phase transition properties in the thermodynamic 
limit, it is obvious that the aggregation-fragmentation 
transition exhibits strong similarities to condensation- 
evaporation transitions of colloidal systems. Given that, 
the entropic transition barrier A<7/T agg , which we see in- 
creasing with the number of chains (cf. the values in Ta- 
ble HI, would survive in the thermodynamic limit and the 
transition was first-order-like. More surprising would be, 
however, if the convex intruder would not disappear, i.e., 
if the absolute and relative surface entropies A5 sur f and 
As sur f do not vanish. This is definitely a question of fun- 
damental interest as the common claim is that pure sur- 
face effects typically exhibited only by "small" systems 
are irrelevant in the thermodynamic limit. This requires, 
however, studies of much larger systems. It should clearly 
be noted, however, that protein aggregates forming them- 
selves in biological systems often consist only of a few 
peptides and are definitely of small size and the surface 
effects are responsible for structure formation and are 
not unimportant side effects. One should keep in mind 
that standard thermodynamics and the thermodynamic 
limit are somewhat theoretical constructs valid only for 
very large systems. The increasing interest in physical 
properties of small systems, in particular in conforma- 
tional transitions in molecular systems, requires in part 
a revision of dogmatic thermodynamic views. Indeed, 
by means of today's chemo-analytical and experimental 
equipment, effects like those described throughout the 
paper, should actually experimentally be verifiable as 
these are real physical effects. For studies of the conden- 
sation of atoms, where a similar behavior occurs, such 
experiments have actually already been performed Q. 



V. SUMMARY 

In this paper, we have extended the microcanonical 
analysis of the aggregation of an exemplified two-peptide 
system [171 ] by interpreting the results from the multi- 
canonical and the canonical perspective as well. In addi- 
tion, these results are compared with aggregation prop- 
erties of larger systems consisting of three and four pep- 
tides, each of which with the same sequence. From the 
conventional canonical analysis of statistical fluctuations 
of energy and a suitably chosen "order" parameter - the 
root mean square distance of the centers of masses of the 
individual polymers - we obtain the typical small-system 
indications of a thermodynamic phase transition: Sharp 
peaks in the specific heat and in the order parameter 
fluctuations at almost the same temperature signalize a 
strong transition, which we clearly identify as the aggre- 
gation transition. For all systems considered, the general 
behavior is similar. There is only this single transition 
which also indicates that conformational changes of the 
polymers accompany the aggregation process and are not 
separate transitions. We expect that this coincidence is 
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sequence-dependent and a comparison between different 
sequences is a study in its own right. At least for the 
scmiflexible homopolymer of same size which in our no- 
tation would have the sequence A13 (or also B13), we find 
that aggregation and collapse are separate processes [29j . 

A quite remarkable result of the exemplified het- 
eropolymer study presented in this paper is that the 
aggregation process of a small number of peptides is a 
phase-separation process, where interfacial surface effects 
entail a loss of entropy. This loss must be compensated 
by additional energy delivered to the system. In conse- 
quence, the caloric temperature decreases, i.e., the aggre- 
gate is getting colder although its total energy increases. 
This is known as the temperature "backbending" which 
is a real thermodynamic effect and not an artefact of the 
theory. In the systems considered throughout the paper, 
the relative influence of the surface effects seems to de- 
crease with the number of chains in the system. Since the 
length of the peptides is fixed by their hydrophobic-polar 



monomer composition, a thermodynamic limit towards 
infinite chain lengths is, however, not existing. It is just 
the smallness of such molecular systems that allow these 
to trigger biological interchange processes which are in- 
evitably connected with conformational activity. 
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